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Non asymptotic minimax rates of testing in signal detection with 

heterogeneous variances 
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Abstract 



The aim of this paper is to estabhsh non-asymptotic minimax rates for goodness-of-fit hy- 
potheses testing in an heteroscedastic setting. More precisely, we deal with sequences (Yj)jgj 
of independent Gaussian random variables, having mean (9j)j^j and variance {aj)j^j. The 
set J will be either finite or countable. In particular, such a model covers the inverse problem 
setting where few results in test theory have been obtained. The rates of testing are obtained 
• with respect to I2 and l^o norms, without assumption on {o'j)jeJ ^-nd on several functions 

I spaces. Our point of view is entirely non- asymptotic. 
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^ ■ 1 Introduction 

CN ' We consider the following heteroscedastic statistical model : 

Yj=0j + ^j^j^ J e J, (1.1) 

a^ " 

O ■ where 9 = {9j)j(zj is unknown, a = {(Tj)j^j is assumed to be known, and the variables {ej)j(zj 

are i.i.d. standard normal variables. The set J is either {1, . . . ,N} for some iV G N* (which 
\ corresponds to a Gaussian regression model) or N* (which corresponds to the Gaussian sequence 

^ ■ model). The sequence 6 has to be tested from the observations {Yj)j^ j in order to decide whether 

^ ■ = 0" or not. The particular case aj = a for all j £ J corresponds to the classical statistical 

model where the variance of the observations is always the same. It has been widely considered 
in the literature, both for test and estimation approaches. In this paper, we consider a slightly 
different setting in the sense that the variance of the sequence is allowed to depend on j. 

We point out that the model (jl.ip can describe inverse problems. Indeed, for a linear operator 
T on an Hilbert space H with inner product (.,.), consider an unknown function / indirectly 
observed in a Gaussian white noise model 

Y{g) = {Tf,g) + ae{g), g(^H (1.2) 

where e{g) is a centered Gaussian variable with variance \\g\\'^ := {g,g)- If T is assumed to be 
compact, it admits a singular value decomposition (SVD) {bj,ipj,(l)j)j>i in the sense that 

T(pj = bjTpj, T*ipj = hj(t)j, 
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with r* the adjoint operator of T. Hence considering the observations Y{ipj), model (|1.2p 
becomes 

= bjOj + aej, j G N*, (1.3) 

with ej = €{'ipj), (Tf,ipj) = bjOj and 9j = {f,(pj). This model is often considered in the inverse 
problem literature, see eg j7]. Setting Yj = bJ^Zj and aj = o-bj^ for all j G N*, we obtain 
(jl.ip . Hence inference on observations from model (jl.ip provides the same results for inverse 
problems. We stress that if estimation issues for inverse problem have been well studied over 
the past years (see for instance [H], [7] or [18] for a model selection approach), tests for inverse 
problems have either only been tackled, in the general case, by very few authors among them we 
refer to [5], [12], [16] or have been investigated only for the very specific case of the convolution 
problem, see in [6] and references therein. 

For all 9 £ hi-J), we set \\9\\l = and for all 9 G looiJ), \\9\\oo = sup^gj |6'j|. The 

purpose of this paper is to provide rates of testing for the hypothesis "9 = 0" against the 
alternative "H^Hg > p" where g = 2 or g = oo . More precisely, given a,/3 g]0, 1[, a level a 
test of the null hypothesis "9 = 0", and a class of vectors J- C lq{J), we define the uniform 
separation rate pq{^a,J^, f3) of the test over the class with respect to the Iq norm as the 
smallest radius p such that the test guarantees a power greater that 1-/3 for all alternatives 
9 G F such that \\9\\q > p. More formally 

/9g(^>a, J",/3) = inf <^ p > 0, inf P0($a rejects )> 1 - /3 I . 

Let us now define Pq{T, a, 13) as the infimum over all level a test of the quantity pq{^a,J^, P)- 
This quantity will be called the (a, /3) minimax rate of testing over the class J-. The aim of the 
paper is to determine this minimax rate of testing over various classes of alternatives J-", for the 
test of null hypothesis "9 = 0" in Model (jl.ip with respect to the I2 and loo norms. 

The main reference for computing minimax rates of testing over non parametric alternatives 
is the series of paper due to Ingster[TT]. where various statistical models and a wide range of sets 
of alternatives are considered. Lepski and Spokoiny |17j obtained minimax rates of testing over 
Besov bodies Bs,p,q{R) in the irregular case (when < p < 2), see also Ingster and Suslina [13]. 
Ermakov [10] determines a family of asymptotic minimax tests for testing that the signal belongs 
to a parametric set against nonparametric sets of alternatives in the heteroscedastic Gaussian 
white noise. In all these references, asymptotic minimax rates of testing are established. In 
Model (jl.ip . with aj = a for all j £ J, Baraud [2j consider a non asymptotic point of view, 
which means that the noise level a is not assumed to converge towards 0. This is the point of 
view that we adopt in this paper. We give a precise expression of the dependency of the minimax 
rates of testing with respect to the sequence {aj)j^j. The particular cases of interest correspond 
to polynomial and exponentially increasing sequences, which in the case of Model (jl.3p leads to 
the so-called mildly and severely ill-posed inverse problems. We do not aim at providing the 
adaptive minimax rates of testing, which will be the core of a future work. 

The paper is organised as follows. In Section 2, we provide lower bounds for the minimax 
separation rate over classes of vectors 9 with a finite number of non-zero coefficients, which yet 
covers sparse signals. In Section 3, we determine upper bounds for those minimax rates. In 
Section 4, we compute minimax rates of testing over ellipsoids and Ip balls. The proofs are 
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gathered in Sections 5 and 6. 

To end this introduction, let us define some notations. Let Y = {Yj)j^j obey to Model p.ip . 
We denote by 6 the vector (or sequence) {6j)j^j and by ¥q the distribution of Y . All along the 
paper, we consider the test of null hypothesis "9 = 0". Let a g]0, 1[ be some prescribed level. A 
test function $q is a measurable function of the observation y, with values in {0, 1}. The null 
hypothesis is accepted if $q, = and rejected if <I>q, = 1. Finally, for all x G M, we denote by [xj 
the greater integer smaller than x and we set \x\ = \x\ +1. 

2 Lower bounds 

The bounds will be established for two classes of signals characterized by their non zeros coef- 
ficients. The first one deals with the elementary case where the coefficients are equal to zero 
after a certain rank. The second one concerns the so-called sparse signals which are defined by 
the amount of non zeroes coefficients which can be located at different scales. 

2.1 Lower bounds in I2 norm 

In this section, we generalize the results obtained by Baraud [2] in an homoscedastic model to 
the heteroscedastic Model (jl.ip . 

We first give a lower bound for the minimax rate of testing over the set 5^), defined for all 
L» > 1 by 

SD = {eei2{J),^j>D,ej = Q}. 

When J = {1, . . . , N}, we assume that D < N . 

Proposition 1 Assume that Y = {Yj)j,zj obeys to Model Let (3 G]0, 1 — a[, c(a,/3) = 

(21n(l+4(l-a-/3)2))^/^ and 

pI) = c{a,(3) I I 

The following result holds: 

yp<pD, inf sup Pe($« = 0) > /3. 

eeSDM\2=p 

This implies that the minimax rate of signal detection over Sd with respect to the I2 norm 
satisfies 

P2{SD,a,/3) > PD- 

This proposition can be understood as follows: whatever the a- level test chosen, for all p < pu, 
there exists a signal 9 E Sd with norm p such that the error of the second kind is greater than 
(3. The results obtained in Proposition [1] coincide with the lower bound established by Baraud 
p] in the homoscedastic model {aj = a £ J). 
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Let us now consider the problem of sparse signal detection. Let fc, n G N* with k < n. When 
J = {1, . . . ,N}, we assume that n < N. We want to obtain lower bounds for the minimax 
separation rate of signal detection over the set Sk,n defined by 



{e G hiJ), Vj > n, Oj = 0, Card {j < n, Oj / 0} < A;} 



■'Lk 



l+k 

E 

j=i+i 



or 



Let /3 g]0, 1 - a[, such that a + P < 59%. Let 



2 

Pk,n 



max ^ In 1 + 



0<Kn-fc 



n — I 



V 



n — I 



1/2" 



V 



ti) 



\j=n—k+l 



For all level a test 



inf Pe($, = 1) < 1 - /3. 
^'6'Sfe,„,||e||2>Pfc,„ 



(2.4) 



Theorem 1 Assume that Y = {Yj)j,zj obeys to Model M.l\j . Let (T(i) < a (2) < . . . < we 
define for all I G {0, . . . , n — k}, 



(2.5) 



(2.6) 



This implies that the minimax rate of signal detection over Sk,n with respect to the I2 norm 
satisfies 

Comments : Let us consider three cases governing the behaviour of the sequence {aj)j£j. 

1. In the homoscedastic case, aj = a for all j G J. In this case, = a'^k for all / and, 
taking I = 0, we obtain that 



Tl Tl 

pl^>a^kln{l + ^vJ^ 



which corresponds to the lower bound established by Baraud [2]. 



2. When k < n/2 and ^fn/2i k — ^'^n-k k some absolute constant C (independent of k 
and n), we obtain that 



Pk,n > 



C^n~k,k^^ 



(^ + 2F^/5)m t 4)1 

^ ' ^ \j=n-k+l I 



1/2 



(2.7) 



At the price of a factor 2 in the logarithm (n is replaced by n/2), the variance term 
appearing in the lower bound for ^ is E^_^ ^ which corresponds to the largest possible 
variance for a set of cardinality k in {1, . . . , n}, indeed 



v2 _ 
^n-k,k — 
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where A4k,n denotes the set of all subsets of {1, . . . , n} with cardinality k. 

This situation occurs for example when {aj)j^j grows at a polynomial rate : cjj = j'^'^ for 

some 7 > 0. In this case, 

^n-k,k — '^^ 1 ^n/2,k — ^2/ 

3. When {aj)j(zj grows at an exponential rate : aj = exp(7j) for some 7 > 0, we obtain that 

pin > ^(n) 

2.2 Lower bounds in l^o norm 

In this section, we provide lower bounds for the minimax rate of signal detection in Model (ll.ip 
with respect to the loo norm. These bounds can be derived from the Theorem [TJ Indeed, let 
Sk^n be defined by ()2.4p . For 6 G Si^n, \\(^\\2 = Halloo- Hence, denoting by pooiJ^,a, /3) the (a, /S) 
minimax rate of testing over the class with respect to the loo norm, we obtain that 

Poo{<Si,n,a,(3) = p2(5i,„,a,/3). 

By definition of the (a, /3) minimax rate of testing it is obvious that if 5 C S' , then pooi<S) < 
Poo (5')- Si^n C Sk,n for all 1 < A; < n. This implies that for all 1 < k < n, 

Poo{<Sk,n,a,f3) > p2(5i,„,a,/3). 

This leads to the following corollary of Theorem [1] : 

Corollary 1 Assume that Y = {Yj)j^j obeys to Model Let (3 G]0, 1 — a[, such that 

a + p< 59%. Let 

Sk,n = {0 e hiJ), Vj > n, Oj = 0, Card {j < n, Oj / 0} < fc} . 
Let (T(i) < (T(2) < • • • < cr(n) ■ define 

Pn,oo= max fT(/+i)A/ln(l + n-0. (2.8) 
The following result holds : 

\/l<k<n, pooiSk,n,a, P) > 

The proof of the corollary follows directly from the arguments given above and therefore will be 

omitted. 

Comments : 

1. Let Sn = {0 £ l2{J),yj > n, 9j = 0} . Note that 5„ = Sn,n, hence Vn > 1, Poo{Sn, ct, f3) > 

Pn,oo ■ 

2. When aj = a for all 1 < j < n, we obtain Pn,oo = cr y^ln (n + 1). 

3. When {aj)j^j grows at a polynomial rate : aj = p for some 7 > 0, we obtain that 
Pn,oo > C{-i)rC<^jMn) by taking I = [n/2j in 

4. When {aj)j£j grows at an exponential rate : aj = exp{j'y) for some 7 > 0, we obtain that 
Pn,oo > C exp(n7) for some constant C. 
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3 Upper bounds 



In this section, we give upper bounds for the (a, /3) minimax rates of testing over the sets Sd 
and Sk,n that we compare with the lower bounds obtained in the previous section. 
In order to show that the (a, /3) minimax rate of testing with respect to the Iq norm over a set 
is bounded from above by p, it suffices to define a test statistic such that the power of the 
test at each point 9 in satisfying \\9\\q > p is greater than 1 — /3. 



tD,i-a{<^) denote the 1 — a quantile ofJ2f=i^'j^'j ' 



Proposition 2 Assume that Y = {Yj)j^j obeys to Model Let a,f3 e]0, 1[, and let 

2 2 

^cTjej > tD,i-a{(T) j = a. 

Ij:-,^.>,,,_.h- (3.9) 



Let the test defined by 

Then, $q, is a level-a test : 



Pe=o(«'a = 1) = a. 

Moreover, there exists an absolute constant C such that for all 9 G Sd, setting 
C{a, 13) = C(log(/3-i) + log(a-i) + V^og{a-') + V^ogiP'')) 

\\9\\l>C{a,(3) lE^l ^Pf?(^'a = l) > 1-/3. 

Hence, we obtain that 

pl{SD,a,P)<C{a,P) 



4 

o 




Note that this bound coincides to the upper bound found in Proposition [TJ Hence, it proves 
that this lower bound is sharp. 



Let us now propose a testing procedure for sparse signal detection. This procedure will be 
defined by a combination of two tests. The first one is based on a thresholding method, which 
was already used for detection of irregular alternatives in Baraud et al [3] and in Fromont et al 
[S]. The second one is the test considered in Proposition [2] with D = n, which will be powerful 
when k is larger that ^/n. 

Theorem 2 Assume that Y = {Yj)j^j obeys to Model U.l\) . Let a, 13 £]0,1[, and let in.i-aCf) 
denote the 1 — a quantile of "^j^^ a'j . Let be the test defined by 
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Let qn,i-a denote the 1 — a quantile 0/ maxi<j<„ e^. Let be the test defined by 



$(2) 



1 if max ^ > qn,i-a 
otherwise. 



We define = max (^^^^^2^ ^^a/s) ' ^ level-a test : 

Pe=o(^a = 1) = a. 
There exists a constant C{a,l3) such that for all 9 G Sk,n satisfying 

1/2 

>C{aJJ) " 



A 



we have 

Hence, we obtain that 



Pe($„ = l)>l-/3. 



pl{Sk,n,a,f3)<Cia,P) 



1/2 



AS 



n—k 



fcln(n) 



(3.10) 



(3.11) 



where /las 6een defined in h2.5\) . 

Comments : Let us compare these results with the lower bounds obtained in Theorem [TJ 



1. We first assume that (crj)jgj grows at a polynomial rate : Vj G J, aj = ap for some 7 > 
(this includes the homoscedastic case). In this case, when k <n/2 there exists a constant 
C > such that Y?^^i^^ ^ > CT,'^_i^ ^. A lower bound for the (a, /3) minimax separation 
rate of signal detection over is given by (j2.7p . This lower bound has to be compared 
with the upper bound p. lip . 



• When k = n with / < 1/2, the upper and lower bounds coincide and are of order 

• When k = with / > 1/2, the lower bound is of order Ti^_|^^^/n/k and S2_^ ^ > 
Cka'^n'^^, which leads to a lower bound of order C(T^n2'>'+^/2. The upper bound is 

smaller that ^X]j=i ^j) > which is smaller than a'^n'^^'^^^'^ . Hence, the two bounds 
coincide. 

• When k = y/n/(j){n) where 0(n) — t- +00 and (j){n)/n — t- as n — t- +c« (typically 
4){n) = ln(n)), the lower bound is of order ^ ln(0(n)) and the upper bound is of 
order E^_^^ln(n). In this case, the upper and lower bound do not coincide, up to a 
logarithmic term. 
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2. Let us now assume that {aj)j^j grows at an exponential rate : Vj G J, aj = aex.p{'yj) for 
some 7 > 0. The lower bound is greater than = a"^ ex.p{2'yn) and the upper bound is 

smaller that C{a, (3) i X]j=i ) ; which is bounded from above by C(a, /3, 7)0"^ exp(27n) . 

(2) 

Hence the two bounds coincide. Note that in this case, the test based on thresholding 
is useless and one can simply consider that test 

* = *(^) 

which achieves the lower bound for the separation rate. 

3. The result stated in (j3.10p is more precise than the minimax upper bound given in (jS.lip . 
If the set Ji = {j, 6j 7^ 0} corresponds to small values for the variances {aj)j^j-^, it is not 
required that H^lH is greater than the right hand term in (j3.1ip for the test to be powerful 
for this value of 9. The minimax bound given in (jS.lip corresponds to the worst situation, 
that is the case where the set Ji corresponds to the largest values for the variances. 

Let us now present an upper bound for the minimax separation rate with respect to the l^o 
norm. 

Corollary 2 Assume that Y = {Yj)j^j obeys to Model Let u^i) < ... < '^(n)- -^^^ 

a,f3 s]0, 1[. Let be the test defined in Theorem\M There exists a constant C{a,/3) such that 
for all k £ {1, . . . ,n} , for all 9 G S^^n such that 



>C{a,f3) 



^ in) 



V 



we have 

Pe($a = 1) > 1 - /3. 

This implies that for a// /c G {1, . . . , n}, 

In \ 



Pooi<Sk,n,a,(3) < C{a,(3) 



o-(n)\/ln(n) A 



v 



This upper bound coincides with the lower bound obtained in Corollary [J when the sequence 
(cjj)jgj is constant or grows at a polynomial or at an exponential rate. 



4 Minimax rates over ellipsoids and Ip balls 

In the previous sections, the only constraint on the signal was expressed through the number 
of non-zero coefficients. In several situations, one deals instead with infinite sequences having a 
finite number of significant coefficients, the reminder being considered as negligible (in a sense 
which will be precised later on). To this end, we consider in this section a slightly different 
framework. Our aim is to study the link between the decay of the ^^'s and the associated rate 
of testing. We consider in the following two different kinds of function spaces: ellipsoids and 
L-bodies. 
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4.1 Minimax rates of testing over ellipsoids 

In the following, we assume that the sequence 9 = {Oj)j(zj belongs to the ellipsoid £a,2{R) defined 
as 

where a = {ak)k&j denotes a monotone non-decreasing sequence. For instance, if 9 corresponds 
to the sequence of Fourier coefficients of a function / and aj is of order with s > 0, then 
assuming that 9 G £a,2{R) is equivalent to impose conditions on the s-th derivative of /. The 
belonging to £a,2{R) may be seen as a regularity assumption on our signal. The following result 
characterises the minimax rate of testing over £a,2{R)- 

Proposition 3 Let a, (3 be fixed and denote by p2{£a,2iR),0(, P) the minimax rate of testing 
over £a,2{R) with respect to the I2 norm. Then 

pl{£a,2{R),a,/3) > sup{plAR^aJy^), 

where p'jj has been introduced in Proposition [71 Moreover, for all D £ J, 

sup = 0)<(3, 

where C = C{a,f3) is a positive constant depending only on a and (3 and $q, denotes the test 
introduced in PropositionlM Hence, 

pl{£a,2{R),a,P) < ^n£{Cpl + R\^'), 

Proposition [3] presents both an upper and a lower bound for the minimax rate of testing over 
£a,2{R)- Remark that the upper bound is attained by the test <I>q introduced in Proposition [2] 
where only signals with a finite number of non-zero coefficients were considered. We do not use 
the whole sequence {Yj)j^j in order to test the null hypothesis "6* = 0" but only the first D 
coefficients. The price to pay is to introduce some bias in the testing procedure. However this 
bias can be controlled by taking advantage of the constraint expressed on the decay of 9. 

A good characterization of p2{£a,2{R)-,Oi, j3) can be obtained as soon as the lower and upper 
bounds are of the same order. As many statistical problems encountered in the literature, one 
has to find a trade-off between the bias R?a~j^ and some kind of variance term p^j^. This trade- 
off can be performed in several situations, hence leading to explicit rates of convergence: see 
Corollary [3] below. 

Proposition [3] presents the minimax rate of testing in a general setting. Several explicit rates 
can be obtained when introducing specific constraints on the sequences {ak)keJ and {bk)k&j- 
These rates are summarized in Corollary [3l Let {i'k)ken* be a sequence real numbers. In the 
following, we write Uk ~ A;' if there exist positive constants ci and C2 such that, for all k G N*, 
cik^ < Uk < C2kK 

Corollary 3 Let a, 13 be fixed. We assume that J = N* and {Zj)j^j obeys to Model /il.3\) . The 
table below presents the minimax rates of testing over the ellipsoids £a^2{R) with respect to the I2 
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norm. We consider various behaviours for the sequences {ak)keN* o,nd {bk)keN*- Por each case, 
we give f{a) such that for alll > a > 0, Ci{a, j3)f{a) < p'^{£a,2{R),C(, (3) < C2{a,l3)f{a) where 
Ci(a,/3) andC2{a,l3) denote positive constants independent of a. 



Mildly ill-posed 



Severely ill-posed 

bk ~ exp(-7F) 




(log(^-^))"'' 



{s < 1) 



Here D denotes the integer part of the solution of pj^ = B? 



a 



-2 

■D 



These rates have aheady been presented in the Uterature. The case Ok ~ k^ and bk ~ k~ was 
first studied in [ll]. More recently, |12] deals with other cases. Similar rates are also available 
in [6] in the context of density estimation with errors in the variables. The aim of Corollary [3] 
is to show that our approach can lead to important minimax results. Our point of view in this 
paper is indeed entirely non-asymptotic and is not restricted to ill-posed inverse problems. 

Concerning severely ill-posed problems with supersmooth functions (i.e. bk ~ exp(— 7/0^) 
and Uk ~ exp{i'k^)), we do not handle the general case since we assume that s < 1. When this 
assumption is violated, the upper and lower bounds in Proposition [3] do not coincide: our test 
does not attain the minimax rate of testing. This is certainly due to our approach, which in 
some sense is related to a rough regularization scheme. We mention [12] for a complete study 
of this case. 

4.2 Minimax rates of testing over /^-bodies with < p < 2 

Ellipsoids contain essentially smooth functions. In the particular case where 9 corresponds to 
the Fourier coefficients of a given function /, the constraints expressed through the belonging 
to one of the spaces introduced above may be incompatible with the presence of discontinuities. 
In order to extend the covered cases, we consider in this subsection sequences 9 belonging to 
/p-bodies £a,p{R) defined as 



where a = {ak)keJ denotes a monotone non-decreasing sequence and < p < 2. The following 
theorem proposes a lower bound for the minimax rate of testing over such spaces. 

Theorem 3 Let {Yj)j^j obey to Model U.l\) . Let q,/3 be fixed and denote by p2{£a,piR),Oi, f3) 
the minimax rate of testing over £a,piR) with respect to the I2 norm. For all D ^ J and for all 
0<1<D - \VD], we set 





where 



is given in \2. 5|) . Then 



p\£a,p{R),a,l3) > sup {pi{D)\/ p2{D)), 
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where 



and 



/)i(L')= max R ao ^ ^PfVD]Di 




To the end of this section, we assume that the sequence {bj)j(zj is polynomially or exponen- 
tially increasing, which yet correspond to the main case of interest in inverse problems. The 
lower bounds in this particular setting are easier to handle, as proved in the following corollary. 

Corollary 4 Let (Zj)jgN* obey to the Model \1.3\) with bj = craj^ for all j G N*. Then, 

assuming that for all j £ , aj = p or that for all j G N*, (Tj = exp{'yj) for some 7 > 0, we 
obtain ^ 

p'^{£a,p{R),a,l5) > C(7) sup(/j^^^^^ A \/D ^R^a'j^) := pl^p^^, 

where ^ defined in i2. 6\) . 

In order to attain the lower bound presented above, a test similar to the one introduced in 
Proposition [2] is not sufficient. On Zp-bodies, the bias after a given rank D is indeed more 
difficult to control than for ellipsoids. Some significant coefficients (in a sense which will be 
precised in the proof) may be contained in the sequence 6 after the rank D. Hence, we have to 
introduce specific tests in order to detect these coefficients. 
More precisely, for all j E J and a G (0, 1), introduce 

where qj^a denotes the 1 — a quantile of a Gaussian random variable with mean and variance 
a'j. Then define 

= ^1oc,q/2 a $Dt,a/2 with ^'ioc,a/2 = SUp ^{j},3a/7r2 (j-Dt)2 , 

je{D'r+l,..N} 

where ^D^,a/2 denotes the test constructed in Proposition [2] and 

= inf {d G j, R'a-^'VD'-'^' < . (4.12) 

By convention, D"^ = N if J = {1, . . . , N} and the set in ()4.12p is empty. The following theorem 
emphasizes the performances of the test 

Proposition 4 Let a,/3 be fixed. We assume that the sequence (a-^bi is monotone 

non-increasing. Suppose that J = {1, . . . , N}. Then 

sup Pe($i = o)</?, 

ee£-a,p(i?),||e!|2>A,p2 



with 
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• Ao- = Clog(A^) for mildly ill-posed problems, i.e {aj)j^j ~ for t > 0, 

• Ao- = Clog(A^)\/^ for severely ill-posed inverse problems, i.e. {aj)j£j ~ {e^^)j^j 
for 7 > 0, 

where C denotes a positive constant independent of a. 

Our test reaches the lower bound estabhshed in Proposition [3] up to a log term, it is not 
sharp. Nevertheless this drawback is not characteristic of the heteroscedastic model since a 
similar property occurs in the homoscedastic case: see [2] for more details. Hence, the lower 
bound established in Proposition [3] corresponds certainly to the minimax rate on £a^p{R). 

For the sake of convenience, the upper bound is presented for J = {!,..., A^} which, roughly 
speaking, corresponds to the regression setting. Nevertheless, our result can be easily extended 
to the case where J = N*. In such a situation, our test will be performed on {1, ... , A^}, where 
is a trade-off between the bias after the rank N on £a,p{R) and the growth of log(A^). A good 
candidate for TV is a power of a~'^. 

In order to conclude this discussion, we point out that we impose a condition on the sequence 

(aj^b,: '•^ . This condition is necessary in order to control the bias after the rank . It 

\ J J ) jgp^ 

always hold when p = 2 since (aj)jgN is an increasing sequence. When p < 2, the considered 
function has to be sufficiently smooth with respect to the ill-posedness of the problem. A similar 
condition can be found for instance in [8]. 

The corollary below deals with the particular case of mildly ill-posed problems with polyno- 
mial /p-bodies, where the situation is easier to handle. 

Corollary 5 Assume that ~ and b^ ~ k^^ for all k £ N* where s,t denote positive 
constants such that s > t{2/p — 1). Then 

43+2/p-l 4s+2/p-l 

C2l0g(A^)fj2''+2«+i/p > p'^{£a,p{^),a,j3) > CiO-a^+at+i/P, 
where Ci,C2 denote positive constant independent of a. 

Remark that the sequence (a - ^b- is monotone non-increasing as soon as s > t(2/p — 

1). Hence the conditions of Proposition U] are satisfied. The proof follows the same argument as 
in Corollary O 

5 Proofs 

5.1 Proof of the lower bounds 

The proofs of the lower bounds use a Bayesian approach extending the methods developed in 
the papers by Ingster [11] and by Baraud [2j. We use the following lemma : 

Lemma 1 Let T be some subset ofl2{J). Let /Xp be some probability measure on 

Tp = {e(^T,\\eh>p} 



12 



and let 

Assuming that P^^ is absolutely continuous with respect to Pq, we define 

For all a > 0, /3 g]0, 1 - a[, if 

Eo < 1+4(1 -a-/3)2, 

then 

yp<p*, inf sup P0($„ = 0) > /3. 

This implies that 

p{T,a,f3)>p*. 

For the proof of this lemma, we refer to Baraud [2], Section 7.1. 



5.1.1 Proof of Proposition [T] 

Let p > 0, we set for 1 < j < D, 



D 



-1/2 



9j = ujja]p \^ a] 

where (wj, 1 < j < -D) are i.i.d. Rademacher random variables : P(a;j = 1) = P(cijj 
Let pp be the distribution of (^i, . . . , ^d). /ip is a probability measure on 



-1) = 1/2. 



{Q^SDAeh = p) 



Let us now evaluate the likelihood ratio L^^ (Y) 



exp 



exp 



2 — 



p' Lj=l 
9 v^-D 4 



cosh 



exp 



Let Z be some standard normal variable. For all A G M, 

E(cosh2(AZ)) = exp(A2) cosh(A2). 



(5.13) 
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Hence, since Yjjoj ~ AA(0, 1) 



Eo (L2^(y)) =n cosh 



Since for all x G M, cosh(2;) < exp(x^/2), we obtain 

1^0 (L^,(n)<exp f^^^ 



For p = we obtain : 

Eo <l + 4(l-a-/3)2, 

which implies that p{Sd,0(, /3) > pn by Lemma[TJ 

5.1.2 Proof of Theorem [T] 

Without loss of generality, we can assume that the sequence ((Tj)jgj is non decreasing (if this 
is not the case, we can reorder the observations 1^). We fix some I £ {0, 1, . . . ,n — k}. Let 
■M-k^i^n denote the set of all subsets of {Z + 1, . . . , n} with cardinality k. Let m be a random set 
of {I + 1, . . . , n}, which is uniformly distributed on M.k,i,n- This means that for all in G l n? 
P(m = m) = 1/C^_i. Let (a;j,l < j < n) be i.i.d. Rademacher random variables, independent 
of 771. Let us recall that 

l+k 

= E -I- 

j=i+i 

We set 

9j = (pujjaj/^i^k) Ijem (5.14) 
Note that 9 = {6j)j^j G Sk^n and that, since {aj)j^j is non decreasing, 

Irlb - P — ^ P • 



E, 



E, 



exp 



exp 



E, 



4E 



1 y2 



I Yj^jP \ I kp^ 



j^l,k 



^^Ik 
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7^ E 

n-l m€Mk,Ln 



exp Yl 



exp 



exp 



^ E n 



We use ()5.13p together with E(cosh(AZ)) = exp(A^/2) for a standard Gaussian variable Z. Since 
Yj/cTj is a standard normal variable, we obtain that 



1 



exp 



X Y\ 6xp 



^2 \ 



E n 

( ^2 



y2 



cosh 



y2 



Since, for all m, m' G Mk,i,n, 

\m\m'\ + |m'\m| + 2\m Ci m'\ = \m\ + \m'\ = 2k, 

we obtain 



k \2 



E 



cosh 



m,m'eMk,l,n 



\mr\m'\ 



The end of the proof is similar to the proof of Theorem 1 in Baraud \2\ , similar arguments are 
also given in Fromont et al. [9]. Let us recall these arguments. 



Eo (L2^(y)) =E 



exp \rh n m \ In cosh 



y2 
^Lk 



where rh,rh' are independent random subsets with uniform distribution on Mk,i,n- For fixed 
m, |m n m'l is an hypergeometric variable with parameters {n — l,k,k/{n — I)). We know from 
Aldous p] that there exists a binomial variable B with parameters {k,k/{n — I)) and a a— 
algebra B such that K{B/B) = |m n m'|. By Jensen's inequality, 



Eo{Ll^iY))<E 



exp i? In cosh 



^Ik 



Since S is a binomial variable with parameters {k, k/{n — /)), 



E 



exp B In cosh 



y2 

Uk , 



exp 



A; In 1 + 



n — / 



cosh 



y"2 

l,k. 



Let c = 1 + 4(1 — a — and A = ln(c). Since the function cosh is increasing on we 
obtain that if 



^Lk 



< In f 1 + ^ + ^2A + aA , 
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then 

cosh 

We finahy obtain that 

Eo [LfjY))<exp 
By Lemma [H this imphes that 



-1 



A. 



fcln 1 + 



n — I 



A 



< c. 



p{Sk,n,a,(3) > S2,ln(l + A + + 
> In (l + 2A\/ V2Tj . 
If a + /3 < 0.59, ln(c) > 1/2, which imphes that 



p(5fc,„,Q,/3) In 1 + 



Since this result holds for all I G {0, n — k}, we get 



n — I n — I 

V 



A;2 



fc2 



/9(5fc,„, a, (3) > max ^, In 1 + 



n — I n — I 



0<l<n-k 



A;2 



V 



/fc2 



In order to prove that 



1/2 



p{Sk,n,a,(3) > X] 



we define, as in the proof of Proposition [H 



2 / V^" 4 





'1/2 



Vj € {n — /c + 1, . . . , n}, 
Vj ^ {n-/c + l,...,n}, 



where {u}j,n — k + l<j<n) are i.i.d. Rademacher random variables. Note that {9j)j^j G Sk^n 
a nd that \\e\\l = p^. We now conclude as in the proof of Proposition [H using that C{a,l3) = 
V2Mc}>l. 

5.2 Proof of the upper bounds 
5.2.1 Proof of Proposition [2] 

In order to prove Proposition [2l we have to show that for all 6 S So such that \\0\\2 > 



D 



Y^Yf <tD,i-c.{<y) I </3. 



(5.15) 
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We denote by tD^^, cr) the /3 quantile of T.f=i Yf , when Y = {Yj)j(,j obeys to Model ([□]). In 
order to prove (I5.15|) . it suffices to show that 

tD,l-aicr) < tD,i3iO,cr)- 

To prove this inequahty, we wih first give an upper bound for and then a lower bound 

for tD,i3iO,cr)- 

Upper bound for t^) i_q((t) : 

We use an exponential inequality for chi-square distributions due to Laurent and Massart |15] 
(see Lemma 1). It follows from this inequality that for all x > 0, 

cy]ie] -\)>2^\Y,(y]\ + 2x sup [a]) I < exp(-x). 



Setting Xq, = ln(l/a), we obtain that 



Since supi<j<£, cr] < \ ajj 

tAi-a(cT)<^c7| + C(a)K;an . (5.16) 
i=i \i=i / 

Lower bound for t£)^j3{6,a) : 

We prove the following lemma, which generalizes the results obtained by Birge [1] to the het- 
eroscedastic framework : 

Lemma 2 Let 

Yj=ej + ajej, l<j<D, 

where ei, . . . eo are i.i.d. Gaussian variables with mean and variance 1. 
We define f = Y,f=i Yf and 

3=1 i=i 
The following inequalities hold for all x > : 

P f f - E(f ) > 2\/S^ + 2 sup (cT^)x) <exp(-x). (5.17) 
¥(t- E(f ) < -2\/S^) < exp(-x). (5.18) 



17 



The proof of this lemma is given in the Appendix. 

Inequahty ()5.18p provides a lower bound for to^piO-, c)- Indeed, setting xp = log(l//3), we obtain 
that 



r-E(r) < -2v/s^ < /3 



Hence, tj:)^p{e,a) > Ej=i(^j + (^j) - "^^/^p- is satisfied if tD,i-a{(^) < tD,p{0,a), which 

holds as soon as 



D 



D 



, V^O-^ + 2Xa sup ((T^) 



(5.19) 



Let us note that 



D 



D 



D 



2^2 



< 



D 



X Vcr^ + \/2 sup (cTj), 



Hence, the following inequality implies (|5.19|) : 

D 



^6'|-2\/2 sup {aj)^/x^ 



l<j<D 



D 



D 



o-j (a/^ + \/^) ~ 2 sup (cr|)xa > 0. 



l<j<D 



This inequality holds if 

D 



sup {aA{xi3 + Xa) + 

i<i<i5 



D 



where C is an absolute constant (which can be taken equal to 8). Hence, we have proved that 

p{SD,a,/3) <C{a,/3), 
which concludes the proof of Proposition [21 



5.2.2 Proof of Theorem M 

The test is obviously of level a thanks to Bonferroni's inequality : 

Po(^a = l) < Po($ii = l)+lPo(<% = l) 



./2 

a a 
< - + - < a. 
- 2 2- 



Let us now evaluate the power of the test. 



M<^^ = 1) > max (nC^il = 1) + = 1)) • 
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It follows from Proposition [2] that for all 9 G Sk^n such that 

1/2 



n 



we have Pe($a^ = 1) > 1 - /?. It remains to evaluate the power of the test ^^^^j^. 

= 0) = Pe(Vi G {1, . . . , n}, ^ < g„,i_„) 



-I 



< Jrif Pe(^ < 



y2 

< P {\ej\ - aj\ej\ < ajy/qn,i-a) 

< P {crj\ej\ > \9j\ - aj^qn,i-a) ■ 

Let qp denote the 1 — /3 quantile of \ej\. We obtain that if 

3i G \ej\>aj{qi3 + ,/q;;j::^), (5.20) 

then 



m^^a^ = 0) < /3. 

Condition (15.20p is equivalent to 



3m G Mk,n, >Y1 + 

n,l— o J 



In particular, if 



then ()5.20p holds. This implies that for all 9 G Sk^n such that 

\\9\\l > max E + 

we have P0(^>k ' = 0) < /3. It remains to give an upper bound for q^^i-a- We use the inequality 
lP(|ei| > x) < exp(-xV2). This leads to 



P( max e- > 21n(n/a)) < nP(|ei| > ^21n(n/a)) 

l<j<n 

< a. 

Hence, qn,i-a < 21n(n/a), which concludes the proof of Theorem [2j 
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5.2.3 Proof of Corollary H 

1/2 



Since \\e\\l > ||(9||^ > C'^{a,l3) (Y.]=i(^j) > we obtain from Theorem [2] that 

= 0) < /3. 

We obtained in the proof of Theorem[2]that if (lOOl) holds, then Pe($i^^ = 0) < /3. Since qn, i-a < 
21n(n/a), we obtain that there exists a constant C{a, (3) such that if ||^||oo > C(q, (3)ai^n) \/ln(n), 

then Pe($i% = 0) < /3. 

5.3 Proof of minimax rates on ellipsoids and /p-bodies 
5.3.1 Proof of Proposition [3] 

We first prove the lower bound. For all D G J, introduce rj-, = pj^ A E?aj^ . Let D be fixed. 
Then for all 6 £ Sd such that \\6\\2 = rj^ 

D 

Y,ap] = Y,a]9]<al\\9\\l<R\ 

Hence 

{e G Sd, \\0\\l = rl} C {0 G SaMR), \ml > rl} . 
Since ru < pu, we get from Proposition [1] 

inf sup P0($a = O)>inf sup = 0) > /3, (5.21) 

e6£-„,2(i?.),j|e||2>ri3 ee5i3,||e||2=ri3 

where the infinimum is taken over all possible level-a testing procedures. Since inequality (j5.2ip 
holds for all G J, we obtain p'^{£a^2{R),Oi, 13) > sup£)gj(/?^ A R^a^). Concerning the upper 
bound, we know from Proposition [2] that the test $a is powerful as soon as: 



D 

2 
ki 



> C{a,(3)pl ^ \\9\\l > C{a,(3)pl + J] 

j=l k>D 



where C(a,/3) denotes a positive constant. Since 9 G £a,2{R), we get 

^9l< R^a-^^ and sup = 0) < /3, 

k>D ee£-„,2{H),||e!P>cp2,+i?2a-2 

where C = C{a,f3). This concludes the proof since the previous result holds for all D £ J. 

5.3.2 Proof of Corollary [3] 

First case: ~ and ~ /c^*. Choosing 

2 

D = (J4s+4t+l , 
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we can remark that and R^clq are of the same order, hence leading to the desired rate. 
Second case: ~ e'^^' and h}~ ~ k"*. Set 



1 \ V« 

,-log(.-) 



Then 



pi(f„,2(i?),«,/3) < Cpl^^+R\-j,l, 
where C denotes a constant independent of a. Concerning the lower bound, we set 



2^^(2t+l/2)/s 



4i/ 



log(a-2) 



1/. 



Then 



> Ca' (log(c7-^)) Aa = Ca' (log(a-^)) ^''^'^'^Z' , 

for some C > 0. 

Third case: Uk ~ and 6^ ~ e~^'''" . Set 



l/r 



Then 



pi{Sa,2{R),a,P) < p'D.+R'a^l, 



< ^^a%-rl + R\ll, 



Do ^ "Do' 

_2N\-2s/r ^ , i \ /n / -2\\-2s/r- 



< a + C(log(a-2)) '^/'■<(C + l)(log(a-2)) 
for some C > 0. Concerning the lower bound, we set 



Then 



1 \ lA' 



pi{£a,2{R),a,l3) > pl^AR^a-^l 



> a%lAR\-^l 



> lAC(log(c7-2)) '^/'• = C(l0g(O) 



-2s/r 
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for some C > 0. 



Fourth case: au ~ e*^^^ and bk ^ e '^^'^ . Denote by D the solution of the equation 

„2 _ r>2~2 

- K . 

Remark that 

pl,<R^a-^l where 1^0= L^J, 
since {p\))D&i* and (a^)i:)eN* are monotone increasing. Hence 

pl{8am,c^,P) < CaV^^S +e-2-^o < (C + l)e-2-^o. 

Then 

PDi > -R^aBi where Di = \D] . 

We get: 

pli£a,2iR),a,f3) > pI^ a R'a-^l > R\-^\ > R^-^'^l 

In order to conclude the proof, we have to prove that the lower and upper bounds coincide. To 
this end, remark that Di = Dq + 1. Thus 

for some constant C as soon as s < 1. 
5.3.3 Proof of Theorem [3] 

The proof will use the one of Theorem[TJ We assume that ((Tj)jgj is non-decreasing. Let us first 
establish a relation between the Ip ball 8a,p{R) and the sets Sk^n- For all D £ J, for all 6 G ^ 

such that \\e\\l <^/D~ '^R^ CLj^i we have 9 G £a,p{R)- Indeed, using Holder's inequality 

+ 00 

E"?."" = «?i < {VD)'-^^/\\\efr/'^^ < Rp. 

j = l j:e,^o 

We set k = \^/D], n = D and for aU / G {0, 1, . . . , n — fc}, we define = {9j,j G J) by 
(|5.14p . As pointed out in the proof of Theorem [H 6 G Sk^n and 116*112 > p^- We also have 
116*111 < p'^^l-k,k/^lk- This implies that if 

^i,k 

then 9 G £a,p{R)- 

Moreover, in the proof of Theorem [H we proved that if 



9 „o . I n — I n — I 



then 



Eo(i.^,(n)<l + 4(l-a-/3)'. 
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This implies by Lemma [T] that p2{£a,p{R)) > P^- We finally get 

\ J n—k,k 

Since the result holds for all / G {0, 1, . . . , n — A:}, we obtain that p2{£a,p{R)) > Pi{D). To obtain 
that p2{£a,p{R)) > P2{D), we consider, as in the proof of Theorem [U for k = \y/D] and n = D 

( \-i/2 
dj = '^jf^lp (Ej=n-fc+i j Vj e {n-/c + l,...,n}, 

= Vj ^ {n- + 

Since p|(£:a,p(i2)) > pi(-D) V p2(-D) for all L> G J, the result follows. 

5.3.4 Proof of Corollary [4] 

For the sake of simplicity, we assume that \fT) is an integer. We derive from Comment 2. of 
Theorem [1] that when Uj = j^, then 



Pi{D) > C(7) '^'i?2^^2 ^ 



Moreover, 



„ < C max < C y j^T 
j=D-VD+l 

When (Tj = exp'''-', we have from Theorem [3] that 

pl{£a,p{R)) > SUpp2p). 

DeJ 

Moreover, P2{D) > \fl^ "^^^ B?a^ A o"|). We conclude by noticing that, in this case, we also 
have that /o^^ < C'{j)a'j-,. 

5.3.5 Proof of Proposition [4] 

It follows from Bonferonis's inequality that is a level-a test. Introduce 

A = {DeJ, R'a-^'^'-'^' < pf^^,^} . 
In a first time, we suppose that A is empty. From the definition of D^, we get D'f = N and 

Pei^i = 0) < Pe(^Dt,«/2 = 0) = Pe{'^N,a/2 = 0) < /?, 
for all sequence 6 satisfying 
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for some positive constant C. Since A is empty and using the comments following Theorem [T] 

Hence, our test is powerful as soon as > PapR- 

From now on, we assume that the set A is not empty: < N. For all j S J, set 



Two different situations may occur: 

1/ For all j > D\ < cr^ p'j for some sequence /ij, i.e. all the coefficients Ok have poor 
importance after the rank D'^ . 

2/ There exists at least j G {D'^,...,N} such that > cr^/^j, i.e. there exist significant 
coefficients after the rank . 

First consider the case 2/. Recall that in this case, the set A is not empty and there exists 
/ G {D'^,...^N} such that > this particular setting, we have to use the 

threshold test in order to detect these coefficients. More precisely, 



Pe{^a = 0) < Pe 1^ sup^ ^{j},3a/7r2(j-Dt)2 = j <Pe (^'{/},3a/7r20'-Dt)2 = oj . 
Thanks to inequality (29) of [2j, we know that this probability is smaller than /3 as soon as: 

This is exactly the assumption made in case 2/. 
Now, we consider point 1/. Let j > D\ 



Then, we get 



<r rr'^-P \" ^PflP«-PA~(2-p) 2-p 



j>Dt 



< cT^-PpPmaxa-P^-^'-^Vv • 



Since the sequence {o-j^bj ^"^ ^^)jeN is assumed to be monotone non increasing, we can control 
the bias as follows 

j>Dt 
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In order to conclude the proof, we have to bound the right hand side of the above inequahty. 
First assume that the problem is mildly ill-posed, i.e. {bk)keN* ~ (fc~*)feeN for some t > 0. Then 

= inf{l)G J,i?2a^2(^)l-2/p<^2^2m/2|^ 

= infj^G J,i?2a^2<^2^2m/p|_ 

Thus 

J2 ^] < CT2-VP(Z)t)<P+l/2(Z)t)2t-tp^2-p ^ ^2pt)2t+l/2^^-p_ (5 22) 

j>Dt 

Hence, we deduce from Proposition [2] that 

P/(^l = 0)<Pe(^Dt,«/2 = 0)</3, 

for all sequence 6 satisfying X^j^i ^| — C'a,/3C7^(-D^)^*"*'^/^, which is equivalent to 

\\ef > C^,pa\D^f'+^'^ + d]. (5.23) 

j>Z3t 

The first point of Proposition H] follows from (j5.22p and (j5.23p . Now assume that the problem 
is severely ill-posed, i.e. (6fc)fceN* ~ (e~'^'^)fceN* for some positive constant 7. In this setting, 

= inf {z? G J,i?2a^2^'"''/' < a^e^^^} . 

Hence, 

j>Dt 

An inequality similar to (|5.23p holds, which concludes the second point. 



6 Appendix 



Proof of Lemma [2] : 

We first compute the Laplace transform of T. Easy computations show that for t < l/(2cr|), 



This implies that for t < mini<j<D l/(2o"|) 



: exp 



l-2ta] V-'^^^j 



D 



E 



exp(tT) 



exp 



D 



^ 1 - 2ta^ n 



• 1 ' 2ta2 



1 Jl-2ta] 



Moreover, 
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This leads to 



E 



exp(t(f -E(r)) 



exp 




2*2 


92.2 


1 - 


2*.2 


exp 


'i 


2*2 


92.2 


1 - 


2*^2 



D 



'-in 



1 



ijl- 2*^2 



D 



ta] exp -^log(l-2*a2) 



We use the following inequality which holds for x < 1/2 : 

ilog(l-2x) + x + ^ 
This inequality implies that for all * < mini<j<£) l/2a'j, 



> 0. 



^ *2^ 

2*^7^ 



logE [exp(*(f - E(f))J < + 2*2 X; ^5^, 



This leads to 



logE 



exp(*(r - E(r)) 



< 



*2S 



1 - 2*SUPi<j<j)(c72)' 



We now use the following lemma which is proved in Birge [4j (see Lemma 8.2) : 
Lemma 3 Let X be a random variable such that 



log(E[exp(*X)]) < 



Mi 

1-6* 



forO<t< 1/6 



where a and 6 are positive constants. Then 

F{X > 2a + bx) < exp(— x) for all x > 0. 



Hence, inequality (|5.17p is proved. Let us now prove inequality (IS.lSp . 
For all z E M, 



r-E(r) < -z 



We have, from the above computations 



D 



ln(E (e*(^-K(^)+-))) =^ 



-f + E(f) - z>0 



< infEfe*(-^+^(^)-^) 
t>0 V 

< infEfe*(^-^(^)+^)" 
t<o 



+ *z. 



We now use (j6.24p for x = ta'j with * < 0. We obtain 



*2^ 



-ln(l-2*.,^)+*.^^ + ^-^<0 



5.24) 
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This implies that 



^—^<-2te]-^Hl-2ta]). 



Hence, for all t < 0, z G 



D 



-log(l-2ta|)+tan 1 + 2-^ + 



tz 



We use this inequahty with z = 2-v/Sx, and = — 

P (f - E(f ) < -2^/S^) < E (^e*^(^-E(f)+2v^)^ _ 

Moreover, 

D 



E e 



,t,(T-E(T)+2v/SS) 



exp 



'^9 



2x 



We use the following inequality which holds for all u > : 

^log(l + 2u) -u> -n^, 
and we apply this inequality to u = —t^c^'j. We obtain that for all a; > 0, 

- E(f ) < -2\/s^) < exp(-x). 
This concludes the proof of Lemma [2j 
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